A structural mixed model to shrink covariance matrices for time-course differential gene expression studies
نویسندگان
چکیده
Time-course microarray studies require a particular modelling of covariance matrices when measures are repeated on the same individuals. Taking into account the within-subject correlation in the test statistics for differential gene expression involves, however, a large number of parameters when a gene-specific approach is used, which often results in a lack of power due to the small number of individuals usually considered in microarray experiments. Shrinkage approaches can improve this detection power in differential gene expression studies by reducing the number of parameters, while offering a good flexibility and a small rate of false positives. The aim of this presentation is to propose several other F-type statistics for timecourse microarray studies based on an extension of the structural mixed model presented by [1] to the multivariate case. We will focus on two main decompositions of the empirical gene-bygene covariance matrices: eigenvalue/eigenvector and Cholesky decomposition. A structural mixed model will be used in three configurations to shrink i) the eigenvalues, ii) the innovation variances, iii) both the innovation variances and antedependence parameters. The F-statistics based on these shrunk covariance matrices were compared to a gene-by-gene analysis, an homogeneous covariance model and the modified Limma F-test both on simulated and real data sets. The proposed methods were found to perform well compared to other empirical Bayesian approaches, and outperformed the gene-specific or common-covariance methods in many cases. Computing the denominator degrees of freedom of these F-statistics under H0 still remains, however, an open question.
منابع مشابه
Differential Expression of Alpha S1 Casein and Beta-Lactoglobulin Genes at Different Physiological stages of the Adani Goats Mammary Glands
Background: Milk proteins genes have been the focus of the researches as the candidate target genes that play a decisive role when animal breeding is desired.Objectives: In the present study, the transcriptional levels of Beta-lactoglobulin (BLG) and Alpha S1 casein (CSN1S1) genes were investigated during prenatal, milking and drying times in mammary glands of the Adani goats which showed...
متن کاملModel-Based Clustering of Longitudinal Data: Application to Modeling Disease Course and Gene Expression Trajectories
6 We consider the problem of clustering time dependent data. The model is 7 a mixture of regressions, with variance-covariance matrices that are allowed 8 to vary within the extended linear mixed model family. We discuss appli9 cations to biomedical data and analyze two longitudinal data sets: one on 10 patients with delirium, and the other on mosquito gene expression following 11 infection. 12
متن کاملTuning of Extended Kalman Filter using Self-adaptive Differential Evolution Algorithm for Sensorless Permanent Magnet Synchronous Motor Drive
In this paper, a novel method based on a combination of Extended Kalman Filter (EKF) with Self-adaptive Differential Evolution (SaDE) algorithm to estimate rotor position, speed and machine states for a Permanent Magnet Synchronous Motor (PMSM) is proposed. In the proposed method, as a first step SaDE algorithm is used to tune the noise covariance matrices of state noise and measurement noise i...
متن کاملStructure of Wavelet Covariance Matrices and Bayesian Wavelet Estimation of Autoregressive Moving Average Model with Long Memory Parameter’s
In the process of exploring and recognizing of statistical communities, the analysis of data obtained from these communities is considered essential. One of appropriate methods for data analysis is the structural study of the function fitting by these data. Wavelet transformation is one of the most powerful tool in analysis of these functions and structure of wavelet coefficients are very impor...
متن کاملInformation Covariance Matrices for Multivariate Burr III and Logistic Distributions
Main result of this paper is to derive the exact analytical expressions of information and covariance matrices for multivariate Burr III and logistic distributions. These distributions arise as tractable parametric models in price and income distributions, reliability, economics, Human population, some biological organisms to model agricultural population data and survival data. We showed that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Computational Statistics & Data Analysis
دوره 53 شماره
صفحات -
تاریخ انتشار 2009